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Abstract 

The maximum size of a binary code is studied as a function of its length n, minimum distance d, and minimum 
codeword weight w. This function B(n, d, w) is first characterized in terms of its exponential growth rate in the 
limit n — > oo for fixed 5 = d/n and uj = w/n. The exponential growth rate of B(n,d,w) is shown to be equal to 
the exponential growth rate of A(n, d) for < ui < 1/2, and equal to the exponential growth rate of A(n,d,w) 
for 1/2 < u> < 1. Second, analytic and numerical upper bounds on B(n,d,w) are derived using the semidefinite 
programming (SDP) method. These bounds yield a non-asymptotic improvement of the second Johnson bound and 
are tight for certain values of the parameters. 

Index Terms 

Constant weight codes, Johnson bounds, semidefinite programming 

I. Introduction 

Two classical functions in combinatorial coding theory are A(n,d), the largest size of a binary code 
of length n and minimum distance d, and A(n, d, w), the largest size of a binary code of length n, 
O ! minimum distance d, and constant weight w. A closely related function is B(n,d, w), obtained from 
A(n,d, w) by relaxing the weight constraint to only require that the weight of each codeword is at least 
w. Codes satisfying a minimum weight constraint are called heavy weight codes in 0, where they are 
motivated by certain asynchronous communication problems. The other relaxation where codewords are 
i/-) \ required to have weight at most w defines the function L(n, d, w). Complementation immediately shows 
■ that L(n,d,w) = B(n,d,n — w). The function L naturally occurs in the proof of the Elias bound |fT51 
Lemma 2.5.1]. It also occurs in the problem of list decoding when bounding the size of the list as a 
function of the decoding radius w. In this problem, L(n,d,w) represents the largest size of a list of 
O [ codewords at distance at most w from the received vector, given a binary code of length n and minimum 
distance d. This function is denoted by A' 2 (n,d,w) in Q~4l, where the Elias Lemma ITT31 Lemma 2.5.1] 
is referred to as the Johnson bound, and is used to prove upper bounds on the list size. 

In the present paper we first characterize the asymptotic exponent of B(n, d, w) as a function of those 
5^ \ of A(n, d) and A(n, d, w) (Theorem [T]). This result is based on the asymptotic unimodality of A(n, d, w), 
which was conjectured in Q Conjecture 2]. Note that, the non asymptotic analogue of this result (posed 
as a research problem in [[161 P-674]) is false as A(15, 6, 6) < A(15, 6, 7) lfT9l . 

Second, we provide upper bounds on L(n,d,w) obtained by the semidefinite programming method. 
From these bounds, we derive a non asymptotic improvement of the Elias/Johnson Lemma in a certain 
range of n, d, and w (Theorem [3]) as well as numerical tables. 

The material is organized as follows. Section [II] contains elementary bounds and some tables of 
B(n,d,w) derived therefrom. Section ITTT1 contains the asymptotic results. Section |TVl is dedicated to the 
SDP method. Section [V] explores three heavy weight codes construction techniques. In Section [VI] we 
provide some concluding remarks. 
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II. Elementary bounds 

In this section we establish a few basic relations between B(n, d, w) and A(n,d,w). 

Note first that B(n,d,w) is increasing in n, and decreasing in d and w. Further, by definition of 

B(n, d, w), we have 

B{n, d, w) > A{n, d,j) forj>w. (1) 
By taking weight classes sufficiently far apart so that they do not overlap, we get 

B(n,d,w)> ^2 A(n,d,w + hd) (2) 

h=0 

where [x\ denotes the largest integer not exceeding x. 

Since any code is a disjoint union of constant weight codes, we have 

n 

B(n,d,w) <^A{n,d,j). (3) 

j=w 

Removing the weight constraint can only improve the size, hence 

B(n,d,w) < A(n,d) = B(n,d,0) . (4) 



The following result is analogous to the first half of the first Johnson bound (3a)] ill 
Proposition 1: For w < n we have 

B{n, d, w) < -B(n - 1, d, w - 1) . 
w 

Proof: Let C be a code realizing B(n, d, w), and consider the matrix whose rows are its codewords. 
Since the average weight of a column, which we denote by W, is given by the total number of l's in the 
matrix divided by n, we get 

w ^wB(n 1 d 1 w 1 _ 
n 

Now, say column I has weight at least W (one such column clearly exists). Pick the subcode of C given by 
the codewords of C that have a 1 in the /-th position. Modify this subcode by deleting the Z-fh component 
of each codeword. If we denote by C the resulting code, we conclude that W < \C'\ < B(n— 1, d, w — 1). 
Using this together with © yields the desired result. ■ 

Finally, the following Gilbert type lower bound is immediate: 

Proposition 2: For all n > 1, d < n, and w < n 



B(n,d,w) > ,™ \ l { 
2^/1=0 U) 



We conclude this section with tables derived from the preceding bounds. Some trivial entries are 
B(n, d, w) = 1 whenever d > min{2w, 2{n — w)}. We limited n and d to the values where A(n, d) and 
A(n, d, w) are known exactly (for all w) in O, jU. Entries of the tables where w > n are left blank. 

'Whether or not the analogous of the second half of the first Johnson bound, i.e. J6] (36)], holds as well remains an open question. 
Specifically, it is unclear at this point whether the inequality 

n 

B(n, d, w) < B(n — 1, d, w) 



is valid. 
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TABLE I: B(n,4,w) 



n 


A(nA) 


w = 2 


w = 3 


w = 4 


w = 5 


w = 6 


w = 7 


w = 8 


vv = 9 


6 


4 


4 


3-4 




3 -4 


1 


1 








7 


8 


8 


7-8 




7-8 


3-5 


1 


1 






8 


16 


16 


15 -16 




15-16 


8-10 


4 -6 


1 


1 




9 


20 


20 


19-20 




19-20 


18-20 


12-18 


4-6 


1 




10 


40 


40 


39- 40 


39-40 


36-40 


30-40 


13-20 


5-7 


1 












TABLE II: B(n,6, 


w) 








n 


A(n,6) 


w = 2 


w = 3 


w = 4 


w = 5 


w = 6 


w = 7 


w = 8 


w = 9 


9 


4 


4 


4 




3- 4 


3- 4 


3-4 


1 


1 


1 


10 


6 


6 


6 




6 


6 


5- 6 


3- 6 


1 


1 


11 


12 


12 


12 




11-12 


11-12 


11- 12 


6- 9 


3- 6 


1 


12 


24 


24 


24 




23- 24 


23-24 


23-24 


12- 24 


9-16 


4- 7 


13 


32 


32 


32 




31-32 


31- 32 


31- 32 


26-32 


18-32 


13- 20 



III. ASYMPTOTICS 

For fixed 5, to G [0, 1], we denote by b(6,u) the exponential growth rate of B(n,d,w) with respect to 
n with d = d{n) = [5n\ and w = w(n) = [^^J , i- e - 

b(5, uS) = limsup (— log B(n, d(n),w(n)) 

where logarithms are taken to the base 2 throughout the paper. The asymptotic exponents of A(n, d, w) 
and A(n, d) are defined similarly and are denoted by a(6,u) and a(8), respectively. 
Proposition 3: For any 5 G [0, 1] and w 6 [0, 1/2], we have b{8, oj ) = a(<5). 
Proof: The Elias-Bassalygo bound lfT8l equation (2.8)] 

A(n,d) A(n,d,w) 

together with the trivial inequality A(n, d, w) < A(n,d) shows that the asymptotic exponents of A(n, d) 
and A(n, d, n/2) are the same. The result then follows by combining the bounds (OQ) and © to obtain 

A{n,d,n/2) < B(n,d,w) < A{n,d) 

for w < n/2. ■ 
The next result provides the main ingredient for proving that b(S,u) = a(6,u) when to G (1/2, 1]. 
Theorem 1: For fixed 5 G [0, 1], a(S,uj) is unimodal in u with a maximum at u = 1/2. 
Corollary 1: For any 5 G [0, 1] and u G (1/2, 1], we have b(S, u) = a(6,u). 
Proof of Corollary {J} We have 

max A(n, d, j) < B(n, d, w) < (n — w+ 1) max A(n,d,j) (7) 

j(z{w,w-\-l,...,n} j£{h'.h'+1,...,ii} 

by for the first inequality and by © for the second inequality. Letting w = [^^J and d = [6n\ we 
get 

max A(n, d, j) = max A(n,d, \jm\) , (8) 

j£{w,w+l,...,n} p€[id,l] 
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TABLE III: B(n,8,w) 



n 


A(n,8) 


w = 2 


w = 3 


w = 4 


w = 5 


w = 6 


w = 7 


w = 8 


w = 9 


12 


4 


4 


4 


4 


4 


4 


3- 4 


3- 4 


1 


13 


4 


4 


4 


4 


4 


4 


4 


3-4 


3-4 


14 


8 


8 


8 


8 


8 


8 


8 


7- 8 


4- 8 


15 


16 


16 


16 


16 


15-16 


15-16 


15-16 


15-16 


10- 16 



and therefore from © we have 

b(S, uj) = sup a(S, p) 

w<p<l 

for any 5 G [0, 1] and a; G [0, 1]. Assuming that 1/2 < < 1, the theorem then follows from Theorem [Q 

■ 

Proof of Theorem^ : We establish that a(S, •) is non-decreasing over [0, 1/2]. This, by complemen- 
tation, shows that a(5, •) is non-increasing over [1/2, 1], proving the claim. 

Fix 5 G [0, 1] and let u)±, U2 be such that < u\ < CO2 < 1/2. Throughout the proof we disregard 
discrepancies due to the rounding of non-integer quantities as they play no role asymptotically. Thus, for 
instance, we shall always treat oj\n as if it is an integer. 

We show that, from a given constant weight code C\ with parameters (n, d = 5n, Wi = coin) such that 
\Ci\ = A(n, d, Wi), it is possible to construct a constant weight code C2 with parameters (n, d, W2 = ^n), 
of size at least equal to \C\\ multiplied by l/(n+ l) 2 . This shows that a(6,U2) > a(5,ui). The code C2 
is obtained from C x via translation. 

For a given fixed codeword c G Ci, let us construct a length n binary vector t of weight 

co 2 — wi 

w = wn = n 

1 - 2^i 

as follows. Consider first the positions of t that form the support of c (wi of them). Pick UiW of these 
positions arbitrarily and assign them l's. Similarly, assign l's to an arbitrary selection of the (1 — l>i)w 
positions that lie outside the support of c. The remaining positions of t are filled with 0's. Note that, by 
of our choice of w, the vector c' = t © c (component wise modulo 2 sum of t and c) has weight vv 2 . 

Now observe that, because the selections made to construct t are arbitrary, for any given c G C\ there 
are 

/ uj\n \ / (1 — uji)n 
\Uiun) \(1 — ujijuin^ 

ways of choosing t for which c' has weight w>2. Therefore, if we now pick t randomly and uniformly 
among all possible sequences of weight w, the probability that this sequence translates a given c G C\ to 
a sequence of weight w 2 is given by 

/ win \ / (1— uii)n \ 
\LUiwnJ \(l—Lai)umJ 

P = ■ 

\um/ 

This implies that a vector t that is randomly and uniformly chosen among all possible sequences of weight 
w translates on average 

pA(n, d, w\) 

codewords from C\ into codewords of weight w 2 (and minimum distance d). Therefore, 

A(n, d, w 2 ) > pA(n, d, wi) . 
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Finally, using the following standard bounds on binomial coefficients^ 



2 nh(k/n) < (n\ < 2 nh(k/n) fe < n 



(n + 1) ~~ \fc 

(see, e.g.. [fTTl Example 11.1.3, p. 353]) shows that 

1 

P> 



(n + 1) 2 

Therefore we obtain 



A(n,d,w 2 ) > t — l -—^A{n,d,w l ) 



from which the theorem follows. 



IV. Upper bounds on L(n,d,w) from semidefinite programming 

The semidefinite programming method is a far reaching generalization of Delsarte linear programming 
method to obtain bounds for extremal problems in coding theory. In the present situation, we aim at 
upper bounding L(n,d,w), which is the maximal number of elements of a code contained in the ball 
B{w) centered at the all-zero word with radius w of the binary Hamming space H n = {0, l} n . We obtain 
numerical bounds for small values of the parameters (n,d,w), which improve the elementary bounds for 
B(n, d,n — w) = L(n, d, w) given in Section II. We also obtain a new bound, which is an explicit function 
of (n,d,w), and improves on the Elias/Johnson bound for some values of these parameters. 

The numerical bounds are obtained by a straightforward application of the SDP method. We refer 
to for a survey of this method and its applications to the binary Hamming space, including the 
case of codes in balls. See also (31 for a survey on the more general subject of symmetry reduction of 
semidefinite programs, with applications to coding theory. In a few words, L(n,d,w) can be interpreted 
as the independence number of a certain graph with vertex set H n , thus is upper bounded by the so-called 
Lovdsz theta number i3 of this graph (or rather by its strengthening •&'), which is the optimal value of a 
certain semidefinite program. This SDP has exponential size, but can be reduced to polynomial size by 
the action of the symmetry group of the graph, which is the symmetry group of B(w), i.e. the group S n 
of permutations of the n coordinates. 

Let us recall that a function F : H% 1 — >• R. is said to be positive definite (or positive semidefinite) if 
the matrix (F(x,y)) indexed by H n is positive semidefinite. This property is denoted F >z 0. In the 
symmetrization process discussed above, a description of the ^-invariant positive definite functions on 
H n is required. This description is in fact provided in [|20l . under the name of block diagonalization of 
the Terwilliger algebra of the Hamming space, and in the framework of group representations in Il22ll . 
Numerical upper bounds for L(n, d, w) obtained in this way are displayed in Tables IV, V, VI. 

For the announced explicit bound, we use a slightly different (and self contained) formulation of the 
SDP bound, which is given in Theorem [2l We shall recover the Elias/Johnson bound as a special case, 
and obtain a new bound in Theorem [3] There, we follow the same line for Hamming balls as the one 
followed for spherical caps in 01. In the latter, the SDP method has lead to numerical bounds and also 
to explicit bounds of degree up to two. 



A. Improving the Johnson bound 

We start with a more handy restatement of the SDP bound, which is essentially the dual form of the 
SDP defining the theta number •&' . The notations are as follows: the space of functions on H n is denoted 
C(H n ) = {/ : H n C} and is endowed with the standard inner product (fi, f 2 ) = Y^xeH fi{ x )f^( x )- 
We shall consider the decomposition of this space under the action of the full automorphism group 

2 h(p) denotes the binary entropy — plogp — (1 — p) log(l — p). 
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Aut(if n ) of the Hamming space and under the action of the symmetric group S n . Since the irreducible 
components are indeed real, we can restrict to the real valued functions. 

The orbit of (x, y) G H 2 under the action of S n is determined uniquely by the values of u := wt(x), 
v := wt(y) and t := d(x,y). Thus the elements of F G C(H 2 ) which are ^-invariant, i.e. which satisfy 
F(gx,gy) = F(x,y) for all g G S n , (x,y) G H 2 , are of the form F = F(u,v,t). With this notation, 
F >z stands for: (x,y) \-¥ F(wt(x),wt(y),d(x,y)) >z 0. 

Theorem 2: Let 

Q(n, d, w) := {(u, v, t) G N 3 : < u, v < w, d <t < n, 
t < u + v, u + v — t = mod 2}. 

Let P(u,v,t) G M[u, v,t] be a polynomial symmetric in (u,v). If P satisfies the following conditions: 

1) P- fohO for some f > 

2) P(u, v, t) < for all (u, v, t) G Q(n, d, w), 

3) P(u, u, 0) < 1 for all u G {0, . . . , w}, 
then 

L(n,d,w) < —. 

Jo 

Proof: For G i?^, let F(x,y) := P (wt(x), wt(y), d(x,y)). We consider for a code C C 

with minimal distance at least equal to d, the sum 

s-.= Yl F ^y)- 

From property (1) of P, we have S > fo\C\ 2 . On the other hand, S = Si + S2 where S\ is the sum over 
pairs (x, y) G C 2 with x = y and S 2 is the sum over the non equal pairs (x, y) G C 2 , x 7^ y. Condition 
2) on P insures that S2 < and condition 3) on P that 5*i < |C|. Altogether we obtain |C| < l//o- ■ 

In order to apply the above theorem with specific polynomials P(w, v, t), we need an explicit description 
of those who are positive definite. Such a description is indeed obtained in EOl . and in [|22l in terms of 
orthogonal polynomials (Hahn polynomials to be precise). As we shall see, for our purpose, we need a 
slightly different expression. 

A general method is explained in fl}, ED, 0, involving group representation. The space C(H n ) can 
be decomposed into the direct sum of S n -irreducible subspaces. The sum of those subspaces which are 
isomorphic to a given irreducible representation of S n is called an isotypic subspace. We recall that certain 
matrices Ek(x,y) are associated to the isotypic components Ik of C(H n ) under the action of S n . Here 
k G [0.. L ri /2J], F-k corresponds to the irreducible representation [n — k, k] of the symmetric group S n , and 
has multiplicity n — 2k + l. Moreover, Ek(x, y) is ^-invariant thus can be expressed in terms of (u, v, t), 
namely E k (x, y) := Y k (u, v, t). Then we have the following characterization (we use the standard notation 
(A, B) = Trace(AP*) for matrices): 

Proposition 4: For all P G M.[u, v, t], symmetric in (u, v ), P >z if and only if 

P(u,v,t)= J2{ F k,E k {z,y)) (9) 

fc=0 

where for k G [0..|n/2j], F k G ]R m fe xm fe, m k = n - 2k + 1, and F k y 0. More precisely, E k (x,y) 
is computed from a decomposition of X k into irreducible subspaces X k = R kj i © . . . Rk,m k - If for all i, 
(e kjij i, . . . ,e]s i fi k ) is an orthonormal basis of R kji in which the action of S n is expressed by the same 
matrices (i.e., not depending on i), then 

E k ,i,j(x,y) = y^ j e k>i>s (x)e k>j>s (y). 

s=l 
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The decomposition of X k with irreducible submodules is not unique but changes E k (x, y) to AE k (x, y)A* 
for an invertible matrix A, see 01 Lemma 4.2]. Note that such a change does not affect the above 
characterization of P being positive definite since (F k , AE k (x, y)A*) = (A*F k A, E k (x, y)) and F k y if 
and only if A*F k A y 0. 

There are essentially two strategies to obtain such a decomposition. One can start from the decom- 
position of X = H n into orbits under the action of S n , namely X = X U ■ • ■ U X n , with X k = {x E 
H n : wt(x) = k}, which leads to a decomposition of the functional space C(X) = C(X ) _L • • • _L C(X n ) 
and then decompose each SVrspace C(X k ), following [TT2l . It is the method adopted in 11221 where the 
corresponding matrices E k (x,y) are obtained in terms of Hahn polynomials. Another approach starts 
from the decomposition of C(H n ) under the full Aut(H n ), namely C(H n ) — P J_ P\ _L ■ • ■ J_ P n where 
Pk = ffiwt(w)=fcCxw, Xw(x) = (— l) w ' x , then decomposes each P k under the action of the subgroup S n . 
Because we want to work with polynomials in (u,v,t) of low degree, this last decomposition is better 
suited. Indeed, if P G R[u, v, t], then x >->■ F(x, y) := P(wt(x), wt(y), d(x, y)) belongs to P _L • • ■ ± P k 
if and only if the total degree of P in the variables (u, t) is at most equal to k. 

An isomorphism of ^-modules between C(X k ) and P k is given by <p k : 

<P k : C{X k ) -)• P k 

f ^ Mf) ■= f( w )^w 

wt(w)=fe 

so we have exactly the same picture for the decomposition of C(H n ) when P k replaces C(X k ), namely 
the irreducible decomposition of P k under the action of S n that is for < k < [§ J , we have 

P k = H 0>k 1 H ltk ±---±H kik (10) 

and the isotypic components of C(H n ), i.e. 

— H k k _L H k>k+ i ± • ■ ■ _L H k n _ k ~ H^ k 2k+l . 

Since n = wt(x), as a function of x, is invariant under and is of degree 1, the isotypic subspace X k 
can also be decomposed as: 

Moreover, starting from an orthonormal basis (e ktS ) of H k ^ k , we obtain an orthonormal basis (u l e kjS ) of 
u l H k:k in which the action of S n is expressed by the same matrices, thus we can use it to compute the 
corresponding matrix E k (x,y) the coefficients of which will be equal to: 

h k 

E k)itj (x,y) = ?iV^e M (i)e tiS (?/). 

s=l 

In other words, it is enough to compute Z k (x,y) := Yl s =i e k,s(x)e k ^ s (y), which is the zonal function 
associated to H k>k , in terms of (u,v,t). We obtain: 

Proposition 5: We have the following expressions for Z k , up to a positive multiplicative constant: 

• Z = l 

• Z\ = — t + u + v — 2uv / n 

. Z 2 = t 2 + (2/(72 - 2))(n -nu-nv + 2uv)t + (l/(n - l)(n - 2))(Au 2 v 2 - An(u 2 v + uv 2 ) + (n + 
2)(n - l)(n 2 + v 2 ) + 2n(n + - 2n(n - \){u + v)) 

Proof: We take the following notations: if wt(vv) = 1, and Wi = 1, we let x% '■= Xw- Let 

U :=n-2u = Y^ =1 Xi(^), 
V :=n-2v = YJl=iXi{v)i 
T := n — 2t = Xi(«)Xi(2/)- 



s 



Following lfT2ll . and the isomorphism (fi k defined above, H kk = ker(d) where d : P k — >■ Pk-i is defined 
by: dxw = J^Xw' where the sum is over the words w' of weight wt(w') = wt(w) — 1, and of support 
contained in the support of w. We set d — d x to specify the variable under consideration and d = d x + d y 
when applied to a function F(x, y) on H 2 . Then, Z k is uniquely determined up to a multiplicative constant 
by the properties: 

1) Z k e R[U, V, T], is symmetric in (U, V), 

2) Z k (x,y) belongs to P k , 

3) dZ k = 0. 

According to the decomposition (flOl ) with pairwise non isomorphic irreducible subspaces, the space of 
functions satisfying conditions (1) and (2) below is of dimension 1 + k. In the variable x, U and T belong 
to Pi, and it is easy to check that U 2 — n, UT — V,T 2 — n, belong to P^. Thus a basis for the space of 
functions satisfying (1) and (2) is given by: 



(k 

k 
k 



{1} 

{UV,T} 
{(U 2 -n)(V 2 



n) 



UVT -U 2 -V 2 + n,T 2 - n} 

The assertion Z = 1 is then trivial. In order to compute Zi and Z 2 , we need formulas for the image 
under d of the monomials in (U, V,T). We compute the following: 




d(UV) = 
dT = U 



n(U + V), 



With the above we obtain that Z\ is proportional to T — -UV. Similarly we obtain: 

r d(U 2 + V 2 ) = 2(n-l)(U + V), 
d{U 2 V 2 ) =2{n-l)(U 2 V 



d(UVT) = (U 2 V + UV 2 ) + 



{d(T 2 ) = 
and Z 2 turns to be proportional to 

rj~i2 



-2(U + V) + 2(U 



UV 2 ), 
n-2)(U 
V)T. 



V)T, 



n 



n 
1 



■{UVT- U 2 -V 2 + n) 



■{U 2 -n){V 2 -n). 



n - l)(n - 2) 

2 , we have that Z k (U,U,0) > which determines the sign of the 



From the identity Z k (x,x) = ^e k}S (x 
multiplicative factor. We obtain the announced formulas. ■ 
Remark: The method used to calculate the polynomials Z k for < k < 2 outlines an algorithmic way to 
compute Z k for general k. It would be more satisfactory to have an expression of these polynomials in 
terms of orthogonal polynomials. 

Now we apply Theorem [2] in order to obtain upper bounds for L(n,d,w). We start with a polynomial 
P(u, v, t) of degree one and recover Elias bound: Let 



P(u, v, t) :=Zi(u, v,t) + d ■ 
—d — t + (u + v 



2w(l — w/n) 
2uv/n) - 2w{\ 



w/n). 
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With /o := d — 2w(l—w/n), we have P — f y 0. If w < n/2, the maximum over [0, w} 2 of u + v — 2uv/n 
equals 2w(l — w/n), and is attained for u = v = w. Thus P(u,v,t) < for (u,v,t) G f2(n, d, w), and 
u, 0) < d. Thus we obtain that if w < n/2 and d > 2w(l — w/n), then 

L(n,d,w)< f --. (11) 

a — 2w(l — w/n) 

It is unclear in general how to design a good polynomial P of degree k. A possible strategy is to start 
from a polynomial L(t) optimizing the bound for A(n,d) and disturb it with a polynomial p(u,v), i.e. 
take P = L(t) + p(u,v). Since L(t) y 0, condition (1) of Theorem [2l is equivalent to F — foE y 0. 
In order to fulfill condition (2), it is enough to have p(u,v) < for [u,v] E [0,w] 2 so one can take 
p(u,v) = (u + v — 2w)s(u, v) or p(u,v) = (u(u — w) +v(v — w))s(u, v) where s(u,v) is a sum of squares. 
For the degree 1, if one follows this line and takes P = (d — t) + X(u + v — 2w) with A > 0, one finds 
that the optimal choice of A is A = 1 — 2w/n and obtains again the Elias bound (fTTI) . For the degree 2, 
we consider accordingly a polynomial P of the form 

P = (t- d)(t - n) + X(u(u -w) + v(v - w)), 

with A > 0. The matrix F (A) associated to P is equal to 

(nd —n — d — Aw 1 + A 

4n/(n - 1) + 2d/n -4/(n - 1) 
4/(n(n-l)) y 

Let /o(A) := det(F (A). The lower left 2x2 corner of F (X) is positive semidefinite so the matrix 
Fo(X) — f E is positive semidefinite if and only if its determinant is non negative, which amounts to the 
condition 

n 2 (n - 1) 
/o < 8d /o(A). 

On the other hand 

P(w, it, 0) = dn + 2Xu(u — w) < dn 

so we obtain the bound 8d 2 /((n— 1)/ (A)). It remains to find the maximum of /o(A), which is a polynomial 
of degree 2 in A: 

^^/o(A) = -((n - l)d + 2(n - w) 2 )X 2 
+d{2n + 2 - 4w)A + d(2d - (n - 1)). 

The maximum is attained for A = d(n + 1 — 2w) / ((n — l)d + 2(n — w) 2 ), A > if w < (n + l)/2, and 
is equal to 

Ad ^p + 2(n- W )(n+l-2 W ) d _ ^ _ ^ 

n((n - l)d + 2(n - w) 2 ) ' 
This last value is positive if and only if 

(n — w) 



d > 



[n 



7jj(<s/2(n - w){n - 1) - (n + 1 - w)) . 



Altogether we obtain: 

Theorem 3: Assume w < (n + l)/2 and 



(n — w 

d > 



{ j—^{^2(n-w)(n-l)-(n+l-w)). 
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TABLE IV: d = 4 



n\w 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 




A(n,4) < 


10 


31 


37 




















40 


11 


42* 


67 




















72 


12 


56* 


100 


138 


















144 


13 


72* 


144* 


221 


248 
















256 


14 


92* 


201* 


340 


411 


486 


503 












512 


15 


114* 


274* 


508 


750 


849 


989 


1002 










1024 


16 


141* 


365* 


736 


1184 


1571 


1767 


1984 


2012 








2048 


17 


171 


477 


1039 


1813 


2602 


2981 












3276 


18 


205 


613 


1437 


2703 


4183 


5041 


6007 


6324 








6552 


19 


243 


776 


1947 


3933 


6541 


9174 


10532 


12249 


12641 






13104 


20 


286 


970 


2594 


5600 


9976 


14966 


19390 


21965 


24834 


25388 




26168 



Then 



L(n, d, w) < 



2d(d+^f 



79 i 2(n—w)(n+l—2w) j / \o' 

gt + '-^ '-a — in — wY 



Example: with the above we obtain L(n,n/2,n/2) < 2n — 1. It is an almost sharp bound in view of 
A(n,n/2,n/2) = 2n — 2 for values of n for which an Hadamard matrix of order n exists [6, Theorem 
10]. Note that adding the all zero codeword to such an Hadamard code yields L(n, n/2, n/2) — 2n — 1. 
Example: For d = 2w(l — w/n) the degree 1 bound does not apply. The degree 2 gives a bound if 
w > n/2 — ^Jn 2 /(2(n + 1)) which equals 

2w(n 2 — w) 



-(n + l)(w-f)' 



B. Tables 

The tables IV, V and VI give upper bounds of L(n,d,w) employing the SDP method. They always 
improve on the bound © (Cf right most column) and sometimes on © when the latter is stronger than 
the former. This situation is indicated by a star exponent. 

In some cases they allow us to derive exact values of L(n, d, w) by using the expurgation technique 
of the next section. These cases are indicated by bold face numbers. To do that we collect the weight 
enumerators of some special binary codes in the notation of |fT6l . 

The weight enumerator of the RM(2,A) dual of the i?M(l,4) is computed by MacWilliams transform 
OH Ch. 5, Th. 1] as 

x 16 + y w + 140(x 12 y 4 + x 4 y 12 ) + 448(x 10 y 6 + x 6 y 10 ) 
+870xV. 

This shows by expurgation that 

L(16,4,4) = 141. 

The weight enumerator of the Nordstrom Robinson code is 

x 16 + y 16 + 112(z 1 V + A 10 ) + 30xV- 

This shows by expurgation 

L(16, 6, 6) = 113, L(16, 6, 10) = 255. 
The weight enumerator of the extended Golay code is 

x 24 + y M + 759(x w y 8 + x 8 y w ) + 2576x 12 y 12 . 



11 



TABLE V: d = 6 



n\w 


6 


7 


8 


9 


10 


11 




A(n,6) < 


14 


51 


56 


63 










64 


15 


74 


96 


113 


127 








128 


16 


113 


157 


207 


228 


255 


255 




256 


17 


159 


250 


318 










340 


18 


205 


409 


481 


563 


677 






680 


19 


259 


554 


752 


913 


1107 






1280 


20 


324 


739 


1200 


1519 


1835 


2096 




2372 



TABLE VI: d = 8 



n\w 


8 


9 


10 


11 


12 


13 


14 


15 


16 




A(n,S) < 


18 


67 




















72 


19 


100 


123 


137 
















142 


20 


154 


222 


253 
















256 


21 


245 


359 


465 
















512 


22 


349 


598 


759 


870 


967 


990 


1023 








1024 


23 


507 


831 


1112 


1541 


1800 


1843 


1936 


2047 


2048 




2048 


24 


760 


1161 


1641 


2419 


3336 


3439 


3711 


3933 


4095 




4096 



Shortening we obtained the dual of the perfect Golay code. 

x 23 + 506x 15 y 8 + 1288X 11 ?/ 12 + 253x 7 y w . 

This shows by expurgation 

L(24, 8, 8) = 760, L(24, 8, 12) = 3336, L(24, 8, 16) = 4095, 

and 

L(23, 8, 8) = 507, L(23, 8, 16) = 2048. 
V. Constructions 

Three well studied code construction techniques are expurgation, translation, and concatenation. In the 
context of heavy weight codes, the first is perhaps mostly of theoretical interest as a good decoding 
algorithm needs not, in general, provide a good decoding algorithm for a subcode. In contrast, the other 
two techniques also provide practical decoding algorithms. 

A. Expurgation 

The following result shows that, for w < d, B(n, d, w) and A(n, d) are essentially the same (recall that 

B(n,d,w) < A(n,d)). 

Proposition 6: For 1 < w < d < n, we have 

B{n,d,w) > A{n,d) - 1. 

Proof: Let C be a code achieving A(n, d). By first translating this code so that to include the all-zero 
codeword, then by removing the all-zero codeword, we get a new code of size A(n, d) — 1, with minimum 
distance and weight both at least equal to d. The proposition follows. ■ 
Theorem 4: For all large enough and even n, all w < n/2, and all d < nh' 1 (1/2)B 

we have 

B(n,d,w) > 2 (n ~ 2)/2 . 

3 /i _1 ( ) denotes the inverse function of the binary entropy over the range [0, 1/2]. 
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Proof: Pick a self dual code above the Gilbert bound IfTTl . This code being binary self-dual, contains 
the all-one codeword, and is therefore self-complementary. Hence, half of its codewords at least have 
weight at least n/2. ■ 

B. Translation 

We assume that the reader has some familiarity with the covering radius concept 0J31- Recall that the 
covering radius of a code is the smallest integer t such that Hamming balls of radius t centered on the 
codewords cover the ambient space. Define R(n, d) as the largest covering radius of a code achieving 
A(n,d). Since the covering radius exceeds [(d — 1)/2J , we get R(n,d) > [(d — 1)/2J with equality iff 
the code that achieves R(n, d) is perfect. A sharper bound on R(n, d) for non perfect codes is obtained 
as a direct consequence of the sphere covering bound 

R(n,d) 

2 n <A(n,d) 

i=0 

The motivation for taking "largest" rather than "smallest" in the definition of R(n, d) is to have the 
best upper bound on w in the next Proposition, which sharpens, in certain cases, Proposition [6j 
Proposition 7: Fix two integers n > 1 and d > 1. If w < R(n, d) then 

B(n, d, w) = A(n, d). 

Proof: Pick a code C realizing A(n,d). There exists a translate of C of weight w as long as w is 
less than or equal to the covering radius of C. This gives B(n, d, w) > A(n, d). The reverse inequality is 
©. ■ 

C. Concatenation 

Consider an heavy weight code of length n, size q, minimum weight w, and distance d. If we concatenate 
this code with a code of length N, size M, and minimum distance D over GF(q), we get a binary code 
of length Nn, weight at least wN, size M and minimum distance dD. Hence, provided B(n, d, w) > q, 
we see that 

B(Nn, dD, wN) > A q (N, D) . 

where A q (N,D) denotes the largest size of a code of length N and minimum distance D, over GF(q). 
Efficient decoding algorithms for concatenated codes can be found in lfT3l . 

VI. Concluding remarks 

We investigated B(n,d,w), defined as the largest number of codewords of weight at least w and 
minimum distance d. The asymptotic exponent of B(n, d, w) is reduced to those of A(n, d) or A(n, d, w), 
depending on w. For finite values of the parameters, we obtained bounds on B(n,d,w) partly using the 
SDP method. As future research, it might be possible to find new exact values of B(n,d,w) by special 
constructions. In this direction, one possibility is to investigate R(n,d) defined in Section fVl 
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